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Correctness of program transformations in extended lambda calculi with a contextual semantics is 
usually based on reasoning about the operational semantics which is a rewrite semantics. A success- 
ful approach to proving correctness is the combination of a context lemma with the computation of 
overlaps between program transformations and the reduction rules, and then of so-called complete 
sets of diagrams. The method is similar to the computation of critical pairs for the completion of term 
rewriting systems. We explore cases where the computation of these overlaps can be done in a first 
order way by variants of critical pair computation that use unification algorithms. As a case study we 
apply the method to a lambda calculus with recursive let-expressions and describe an effective uni- 
fication algorithm to determine all overlaps of a set of transformations with all reduction rules. The 
unification algorithm employs many-sorted terms, the equational theory of left-commutativity mod- 
elling multi-sets, context variables of different kinds and a mechanism for compactly representing 
binding chains in recursive let-expressions. 

1 Introduction and Motivation 

Programming languages are often described by their syntax and their operational semantics, which in 
principle enables the implementation of an interpreter and a compiler in order to put the language into 
use. Of course, also optimizations and transformations into low-level constructs are part of the implemen- 
tation. The justification of correctness is in many cases either omitted, informal or by intuitive reasoning. 
Inherent obstacles are that programming languages are usually complex, use operational features that are 
not deterministic like parallel execution, concurrent threads, and effects like input and output, and may 
even be modified or extended in later releases. 

Here we want to pursue the approach using contextual semantics for justifying the correctness of 
optimizations and compilation and to look for methods for automating the correctness proofs of trans- 
formations and optimizations. 

We assume given the syntax of programs 'P, a deterministic reduction relation — C !P x !P that 
represents a single execution step on programs 

and values that represent the successful end of program execution. The reduction of a program may 
be non-terminating due to language constructs that allow iteration or recursive definitions. For a program 
P G !P we write f J| if there is a sequence of reductions to a value, and say P converges (or terminates 
successfully) in this case. Then equivalence of programs can be defined by P\ ~ P2 <^=^ (for all C : 
C[Pi]-IJ. <;=^ C[P2]JJ-)5 where C is a context, i.e. a program with a hole [•] at a single position. Justifying 
the correctness of a program transformation P^ P' means to provide a proof that P P'. Unfortunately, 
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the quantification is over an infinite set: tlie set of all contexts, and tlie criterion is termination, wliich is 
undecidable in general. Well-known tools to ease tlie proofs are context lemmas (91, ciu-lemmas |16i and 
bisimulation, see e.g. [7]. 

The reduction relation — )• is often given as a set of rules /, — )• r, similarly to rewriting rules, but 
extended with different kinds of meta-variables and some other constructs, together with a strategy de- 
termining when to use which rule and at which position. In order to prove correctness of a program 
transformation that is also given in a rule form — )• S2, we have to show that o{si) ~ 0(52) for all 
possible rule instantiations a i.e. C[a(si)]JJ. <;=^ C[a(5'2)]JJ- for all contexts C. Using the details of the 
reduction steps and induction on the length of reductions, the hard part is to look for conflicts between 
instantiations of si and some /,, i.e. to compute all the overlaps of /, and 5^1, and the possible completions 
under reduction and transformation. This method is reminiscent of the critical pair criterion of Knuth- 
Bendix method LBJ but has to be adapted to an asymmetric situation, to extended instantiations and to 
higher-order terms. 

In this paper we develop a unification method to compute all overlaps of left hand sides of a set of 
transformations rules and the reduction rules of the calculus L„eed which is a call-by-need lambda calculus 
with a letrec-construct (see [ 12]). We show that a custom-tailored unification algorithm can be developed 
that is decidable and produces a complete and finite set of unifiers for the required equations. The follow- 
ing expressiveness is required: Many-sorted terms in order to avoid most of the junk solutions; context 
variables which model the context meta-variables in the rule descriptions; context classes allow the uni- 
fication algorithm to treat different kinds of context meta-variables in the rules; the equational theory of 
multi-sets models the letrec-environment of bindings; Empty sorts are used to approximate scoping rules 
of higher-order terms, where, however, only the renaming can be modeled. Since the reduction rules are 
linear in the meta-variables, we finally only have to check whether the solutions produce expressions that 
satisfy the distinct variable convention. Binding Chains in letrec-expressions are a syntactic extension 
that models binding sequences of unknown length in the rules. This also permits to finitely represent 
infinitely many unifiers, and thus is indispensable for effectively computing all solutions. 

The required complete sets of diagrams can be computed from the overlaps by applying directed 
transformations and reduction rules. These can be used to prove correctness of program transformations 
by inductive methods. 

Since our case study is done for a small calculus, the demand for extending the method to other 
calculi like the extended lambda calculus in lITSl would justify further research. 

In Section|2]we present the syntax and operational semantics of a small call-by-need lambda calculus 
with a cyclic let. The normal order reduction rules and transformations are defined. In Section [3j the 
translation into extended first-order terms is explained. Section|4]contains a description of the unification 
algorithm that computes overlaps of left hand sides of rules and transformations in a finite representation. 
Finally, in Section |5] we illustrate a run of the unification algorithm by an example. 

2 A Small Extended Lambda Calculus with letrec 

In this section we introduce the syntax and semantics of a small call-by-need lambda calculus and use it 
as a case-study. Based on the definition of the small-step reduction semantics of the calculus we define 
our central semantic notion of contextual equivalence of calculi expressions and correctness of program 
transformations. We illustrate a method to prove the correctness of program transformations which uses 
a context lemma and complete sets of reduction diagrams. 
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2.1 The Call-by-Need Calculus L^eed 

We define a simple call-by-need lambda calculus Lmed which is exactly the call-by-need calculus of |[T2]| . 
Calculi that are related are in [|14i . and [H. 

The set £ of L„^gj-expressions is as follows where x,Xi are variables: 

Si,s,t£'E ::= x\{st)\{Xx.s)\{letrecxi=si,...,x„=Sf,iiit) 

We assign the names application, abstraction, or letrec-expression to the expressions {s t), {Xx.s), 
(letrec x\ = s\, . . . ,Xn = Sn in t), respectively. A group of letrec-bindings, also called environment, 
is abbreviated as Env. 

We assume that variables x, in letrec-bindings are all distinct, that letrec-expressions are identified 
up to reordering of binding-components (i.e. the binding-components can be interchanged), and that, 
for convenience, there is at least one binding. Letrec-bindings are recursive, i.e., the scope of Xj in 
(letrec x\ = s\,. . . ,Xn-i = in s„) are all expressions si with I <i <n. Free and bound variables 
in expressions and a-renamings are defined as usual. The set of free variables in t is denoted as FV{t). 
We use the distinct variable convention (DVC), i.e., all bound variables in expressions are assumed to 
be distinct, and free variables are distinct from bound variables. The reduction rules are assumed to 
implicitly a-rename bound variables in the result if necessary. 

A context C is an expression from Lneed extended by a symbol [•], the hole, such that [•] occurs exactly 
once (as sub-expression) in C. A formal definition is: 

Definition 2.1 Contexts C are defined by the following grammar: 

C £ C ::= [•] I (C s) I (s C) I (Ax.C) | (letrec xi = si, . . . ,Xn = s„ in C) | (letrec Env,x = C ins) 

Given a term t and a context C, we write C[t] for the L„gerf-expression constructed from C by plugging t 
into the hole, i.e, by replacing [•] in C by t, where this replacement is meant syntactically, i.e., a variable 
capture is permitted. Note that a-renaming of contexts is restricted. 

Definition 2.2 The unrestricted reduction rules /or the calculus L^ed are defined in Figure [7] Several 
reduction rules are denoted by their name prefix, e.g. the union of(llet-in) and (llet-e) is called (llet), the 
union of(cp-e) and (cp-in) is called (cp), the union of (llet) and (lapp) is called (III). 



(Ibeta) {{Xx.s) r) — )• (letrec x = r ins) 

(cp-in) (letrec X = s, Env in C[x]) ^ (letrec x = s, Env inC[s]) 

where s is an abstraction or a variable 
(cp-e) (letrec x = s,Env,y = C[x] in r) — t- (letrec x = s,Env,y = C[s] in r) 

where s is an abstraction or a variable 
(Uet-in) (letrec Envi in (letrec Env2 in r)) — t- (letrec Envi,Env2 in r) 
(llet-e) (letrec Envi,x = (letrec Env2 in Sx) in r) — t- (letrec Envi,Env2,x = Sx in r) 
(lapp) ((letrec Env in t) s) — )■ (letrec Env in (t s)) 



Figure 1 : Unrestricted reduction rules of L^eed (also used as transformations) 

The reduction rules of L„eerf contain different kinds of meta- variables. The meta- variables r,s,Sx,t 
denote arbitrary L„egrf-expressions. Env,Env\,Env2 represent letrec-environments and x,y denote bound 
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variables. All meta-variables can be instantiated by an L„ggrf-expression of the appropriate syntactical 
form. A reduction rule p = / — )• r is applicable to an expression e if I can be matched to e. Note that an 
expression may contain several sub-expressions that can be reduced according to the reduction rules of 
Figure [T] 

A standardizing order of reduction is the normal order reduction (see definitions below) where re- 
duction takes place only inside reduction contexts. 

Definition 2.3 Reduction contexts ^ application contexts ^ and surface contexts S are defined by the 
following grammars: 

A £ := [•] I (A 5) where s is an expression. 
R £ '.= A I letrec in A I letrec 3^1 = AijSnv in A[ji] 
I letrecyi =Ai,{yi+i = Ai+i\yi]}'^^^,Env TD.A\yn] 
S£S '■= [■] \ {S s) \ {s S) \ {letrec yi = si, . . . ,yn = s„ ±iL S) \ {letr ec Env,y = S in s) 

A sequence of bindings of the form y^+i = A,n+\ \ym],ym+2 = Am+i^m+i] , ■■■,yn= A„[y„-i] where 
the yi are distinct variables, the A,- are not the empty context and m < « is called a binding chain and 
abbreviated by {3;,+i = A,+i bi]}"=m- 

Definition 2.4 Normal order reduction (called no-reduction for short) is defined by the reduction 
rules in Figure^ 



(Ibeta) R[{Xx.s) r] — )• /?[letrec x = r Ins] 

(cp-in) letrec y = s,Env in A\y] — )• letrec y = s,Env in A[s] 

where s is an abstraction or a variable, 
(cp-e) letrec 3^1 = ^■,3^2 = ^liyij^Env in A|j2] — ^ letrec yi = s,y2 = A2[s],Env in A[j2] 
(cp-e-c) letrec yi = s,y2 = A2[3'i],{j,-+i = A,-+i[3;i]}^^2'-^"^ inA[3;„] 

letrec ji = s,y2 =A2[s],{yi+i = Ai+i[yi]}1^2^Env inA[y„] 

in the cp-e rules s is an abstraction or a variable and A2 is a non-empty context. 
(Uet-in) (letrec Envi in (letrec Env2 in r)) — )• (letrec Envi,Env2 in r) 
(Uet-e) letrec = (letrec fwi inr),Env2 inA[yi] —t- letrec = r,Envi,Env2 inA\yi] 
(Uet-e-c) letrec = (letrec Envi in r),{j,-+i = A,+ i[3',]}"^j,£'w2 inA[y„] 

letrec ji = r,Envi,{yi+i = A,-+i [y,-]}"^j,£'«V2 inA[3;„] 
(lapp) /?[((letrec Env in r) t)] /?[(letrec Env in (r t))] 



Figure 2: Normal order reduction rules of Lneed 

Note that the normal order reduction is unique. A weak head normal form in Lneed (WHNF) is defined 
as either an abstraction Xx.s, or an expression (letrec Env in Xx.s). 

The transitive closure of the reduction relation — is denoted as ^ and the transitive and refiexive 
closure of — )• is denoted as A. Respectively we use "°'^> for the transitive closure of the normal order 

no fio fc 

reduction relation, — ^ for its reflexive-transitive closure, and — ^ to indicate k normal order reduction 

no * 

steps. If for an expression t there exists a (finite) sequence of normal order reductions t — -)• t to a 
WHNF t', we say that the reduction converges and denote this as ? JJ. or as tij. if t' is not important. 
Otherwise the reduction is called divergent and we write ff^. 

The semantic foundation of our calculus L^eed is the equality of expressions defined by contextual 
equivalence. 
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Definition 2.5 (Contextual Preorder and Equivalence) Let s,t be Lneed-^^ressions. Then: 

s<ct iff VC: C[s]4^C[?]4 
S'^ct iff s<c?A?<c5 



Definition 2.6 A program transformation T C L^ge^/ x L„eed o- binary relation on Lneed-^xpressions. A 
program transformation is called correct iff T CI 

Program transformations are usually given in a format similarly to reduction rules (as in Figure[T]and 
Figure|2jl. A program transformation T is written ass —>t where s,t are meta-expressions i.e. expression 
that contain meta-variables. Here we restrict our attention for the sake of simplicity to the program 
transformations that are given by the reduction rules in Figure [T] 

An important tool to prove contextual equivalence is a context lemma (see for example |l9l, |[T3l . llT5l ). 
which allows to restrict the class of contexts that have to be considered in the definition of the contextual 
equivalence from general to ^ contexts. 

However, often 5-contexts are more appropriate for computing overlaps and closing the diagrams, 
so we will use 5-contexts instead of ^-contexts. 

Lemma 2.7 Let s,t be L„ggij-expressions and S a context of class S- {S[s]ii- =^ 5[f]JJ-) iff VC : (C[5']-IJ- ^ 
C[t]i\.); i.e. s<ct. 

Proof. A proof of this lemma when the contexts are in class ^ is in |[T3]| . Since every ^-context is also 
an 5-context, the lemma holds. □ 

T 

To prove the correctness of a transformation 5 — > ? we have to prove that s t s <c t A t <c s 
which by Definition 2.5 amounts to showing VC : C[s]ij- =^ C[t]ij- A C[f]JJ. =^ C[s]ij.. The context lemma 
yields that it is sufficient to show \/S : S[s]il- =^ S[t]]j. A S[t]]j. =^ S[s]]^. We restrict our attention here to 
S[s]il- =^ S[t]il- because S[t]ij. =^ S[s]ij. could be treated in a similar way. To prove s ~cf we assume that 

It remains to 



3(a) I 



s t and S[s]\^ holds, i.e. there is a WHNF s', such that S[s] — ^ s' (see Figure 
show that there also exists a sequence of normal order reductions from S[t] to a WHNF. This can often 

be done by induction on the length k of the given normal order reduction S[s] -^^^ s' using co mplet e 

sets of reduction diagrams. Therefore we split S[s] -^^^ s' into S[s\ So ""'^ S s' (see Figure 



3(b) I 



— - 5'[i'] A S[t\ can be closed specifying two 



Then an applicable /orfaVig diagram defines how the fork sq 
sequences of transformations such that a common expression t' is eventually reached: one starting from 
S[t] consisting only of no-reductions and one starting from 5'o consisting of some other reductions (that 



are not normal order) denoted by T' in Figure 3(c) 



S[s] 

no,/: I 



■S\t] 



S[s]^S[t] 



Ss] 

no 
So ■ 



■S[t] 



S<rt 



S[s] 



■S[t] 



T'* 



s' ■^O ^0 - 7 ^ t' So 

(a) Forking in the proof of no,k-\\^ „o,,t-ll ^ '* 

(b) Splitting tiie no-sequence (c) Application of a forking T ,* whnf 

diagram (d) Inductive proof of s <c t 



Figure 3: Sketch of the correctness proof for s t 
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A set of forking diagrams for a transformation T is complete if the set comprises an applicable 
diagram for every forking situation. If we have a complete set of forking diagrams we often can induc- 



tively construct a terminating reduction sequence for S[t] if S[s\i\- (as indicated in Figure 3(d) i. To prove 
S[t]\\. =^ S[s]ij. another complete set of diagrams called commuting diagrams is required which usually 
can be deduced from a set of forking diagrams (see fTS]). We restrict our attention to complete sets of 
forking diagrams. 

Example 2.8 Example forking diagrams are 

iSJIet-e iS;llet-e 



I 

no,Het-in I no.Uet-in 



noJlet-in\ 



Y iS,llet-e Y „ I ^ no.Uet-in 

. ^ . no,Ilet-e y 

where the dashed lines indicate existentially quantified reductions and the prefix iS marks that the trans- 
formation is not a normal order reduction (but a so called internal reduction which we also call transfor- 
mation), and occurs within a surface context. By application of the diagram a fork between a (no,llet-e) 
and the transformation (llet-in) can be closed. The forking diagrams specify two reduction sequences 
such that a common expression is eventually reached. The following reduction sequence illustrates an 
application of the above diagram: 

(letrec Envi,x = (letrec Env2 ins) in (letrec £"^3 in r)) 

no,llet-in , , r-. . \ . n 

7- (letrec £,?ivi,£,W3,.x; = (letrec £,?iV2 m 5j m rj 

iSyno,llet e^ (±etrec Envi,Envi,Env2,x = s in r) 
the last reduction is either an no-reduction ifr =A[x], otherwise it is an internal reduction 
iSJlet e^ (letrec Env\,Env2,x = s in (letrec Env-^ in r)) 
no,llet {\etrec Env\,Env2,Envi.,x = s in r) 

The square diagram covers the case, where (no, llet-in) is followed by an internal reduction. The 
triangle diagram covers the other case, where the reduction following (no,llet-in) is (no,llet-e). One can 
view the forking diagram as a description of local confluence. 

The computation of a complete set of diagrams by hand is cumbersome and error-prone. Nevertheless 
the diagram sets are essential for proving correctness of a large set of program transformations in this 
setting. For this reason we are interested in automatic computation of complete diagram sets. 

The first step in the computation of a complete set of forking diagrams for a transformation T is the 

no Tsd iS T 

determination of all forks of the form i — — • — — where red is an no-reduction and T is not a nor- 
mal order reduction (but a transformation in an 5-context). Such forks are given by overlaps between 
no-reductions and the transformation. Informally we say that red and T overlap in an expression s s 
contains a normal order redex red and a T redex (in a surface context). To find an overlap between an 
no-reduction red and a transformation T it is sufficient, by definition of the normal order reduction, to de- 
termine all surface-positions in red where a T-redex can occur. For the computation of all forks we have 



to consider only critical overlaps where an overlap does not occur at a variable position (Example 2.8 
illustrates such a critical overlap). Forks stemming from non-critical overlaps at variable positions can 
always be closed by a predefined set of standard diagrams. All (critical) overlaps between no-reductions 
and a given transformation T can be computed by a variant of critical pair computation based on unifi- 
cation. The employed unification procedure will be explained in the next section. 
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3 Encoding Expressions as Terms in a Combination of Sorted Equational 
Theories and Context 

In this section we develop a unification method to compute proper overlaps for forking diagrams. Ac- 



cording to the context lemma for surface contexts (Lemma 2.7 1 we restrict the overlaps to the trans- 
formations applied in surface contexts. A complete description of a single overlap is the unification 
equation ,] = lno,j, where Ij i is a left hand side in Figure[TJ and /„o y a left hand side in Figure[2j and 
S means a surface context. To solve these unification problems we translate the meta-expressions from 
transformations and no-reduction rules into many sorted terms with some special constructs to mirror the 
syntax of the reduction rules in the lambda calculus. The constructs are i) context variables of different 
context classes and C, ii) a left-commutative function symbol env to model that bindings in letrec- 
environments can be rearranged iii) a special construct BCh(. . .) to represent binding chains of variable 
length as they occur in no-reduction rules. 

The presented unification algorithm is applicable to terms with the mentioned extra constructs. We 
do not use the general unification combination algorithms in 1 1 1 , 2], since we only have a special theory 
LC that models multi-sets of bindings in letrec-environments of our calculus, and moreover, it is not clear 
how to adapt the general combination method to context classes and binding chains. 



3.1 Many Sorted Signatures, Terms and Contexts 

Let 5 = 5i tfJa be the disjoint union of a set of theory-sorts S\ and a set of free sorts Si- We assume 
that Exp is a sort in Si- Let Z = £i l±) £2 be a many-sorted signature of (theory- and free) function 
symbols, where every function symbol comes with a fixed arity and with a single sort-arity of the form 
f : S\ X . . . X Sn ^ Sn^i, where Si for i = I, ... ,n are the argument-sorts and 5„+i is called resulting sort. 
For every / G £, for / = 1,2 the resulting sort must be in Si- Note, however, that there may be function 
symbols / G £, that have argument-sorts from Sj, for / 7^ j. There is a set 'P'" of first-order variables 
that are 0-ary and have a fixed sort and are ranged over by x,y,z,..., perhaps with indices. We write 
if the variable x has the sort S. There is also a set of context-variables which are unary and are 
ranged over by X,Y,Z, perhaps with indices. We assume that for every sort S, there is an infinite number 
of variables of this sort, and that there is an infinite number of context variables of sort Exp — >• Exp. 
Let I/' = 1/"^ U 1^^. The set of terms 'r(5,£, 1/') is the set of terms built according to the grammar 
X I f{ti, . . . ,tn) I X{t), where sort conditions ai^e obeyed. Let Var{t) be the set of first-order variables 
that occur in t and let Var^{t) be the set of context variables that occur in t. A context C is a term in 
^{Exp^lLVJ [•], T^) such that there is exactly one occurrence of a the special hole constant [•] in the context 
and the sort at the position of the hole is Exp. 

A term s without occurrences of variables is called ground. We also allow sorts without any ground 
term, also called empty sorts, since this is required in our encoding of bound variables. The term s is 
called almost ground, if for every variable x in s, there is no function symbol in £ where the resulting 
sort is the sort of x, and hence no ground term of this sort. 

A substitution a is a mapping a : 'P' — )• 'r(5,r, '^"'), such that cj(x'^) is a term of sort S and o{X) is 
a context. As usual we extend a to terms, where every variable x in a term is replaced by o{x). 



3.2 Encoding of L„eerf -Expressions as Terms 



The sort and term structure according to the expression structure of the lambda calculus L^eed (from 
section 2.1 1 is as follows. There are the following sorts: Bind, Env, Exp, BV , for bindings, environments. 
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expressions and bound variables, respectively; where Si 
the following function symbols: 



{Env} and Si = {Bind, Exp, BV}. There are 



theory function symbols (L\ ) 


free function symbols (£2) 


emptyEnv :: Env 
env : : Bind x Env — t- Env 


Zef : : Env x E'x/? — t- Exp 
app : : Ex/? x Exp — )• fx/? 
/am : : BV x Be/? — )• Sjcp 


/j/«(i :: Sy X Exp - 
varwBV 


-7- Bind 



Note that there are free function symbols that map from Env to Exp, but there is no free function 
symbol that maps to Env. Note also that there is no function symbol with resulting sort BV, hence this is 
an empty sort, and every term of sort BV is a variable. 

It is convenient to have a notation for nested env-expressions: env* {{t\ ,...,?„,} U r) denotes the term 
env{t\,env{t2, ■ ■ ■ ,env{tm,r) . . .)), where r is not of the form env{s,t). Due to our assumptions on terms 
of sort Env and the sort of context variables, only the constant emptyEnv and variables are possible for r. 

As an example the expression (letrec x = Xy.y,z = x x in z) is encoded as 
let{env*{{bind{x,lam(y,var{y))),bind{z,app{var{x),var{z)))} L) empty Env), var{z.)), where x,y,z are 
variables of sort BV. 

To model the multi-set property of letrec-environments, i.e., that bindings can be reordered, 
we use the equational theory left-commutativity (LC) with the following axiom: env{x,env{y,z)) = 
env{y,env{x,z)) (for the LC-theory and unification modulo LC see f5^, '45). The equational theory LC 
is a congruence relation on the terms, which is denoted as =lc- The pure equational theory is defined as 
restricted to the axiom-signature, i.e. to the terms ^ {{Env, Bind}, Li, T^m U ^sind), where 1^ is the set 
of variables of sort S. The combined equational theory is defined on the set of terms ^{S,^, "V^). Note 
that it is a disjoint combination w.r.t. the function symbols, but not w.r.t. the sorts. 

The following facts about the theory LC can easily be verified: 

Lemma 3.1 For the equation theory LC, the following holds in T^{S,'L, ^^): 

• The terms in the LC-axioms are built only from T,i-symbols and variables, and the axioms relate 
two terms of equal sort which must be in Si- 

• For every equation s =lc t, the equality Var{s) = Var{t) holds. 

• The equational theory LC is non-collapsing, i.e, there is no equation of the form x =ix: t, where t 
is not the variable x. 

• IfC\s\ =Lc t and s has a free function symbol as top symbol, then there is a context C' and a term 
s' such that C[s] =ic C'[s'],C' =lc C, s =lc ^ <^fid els'] = t. This follows from general properties 
of combination of equational theories and properties of the theory LC. 

• The equational theory LC has a finitary and decidable unification problem ( seeSSl 

In order to capture binding chains of variable length as they occur in the definition of the 
no-reduction rules (Figure [2]) the syntax construct BCh(A'^i ,A'2) is introduced, where Ni are inte- 
ger variables that can be instantiated with Ni ^ ni, N2 «2> where < «i < «2- An instance 
BCh(?ii,?i2) for ni,n2 > 1 represents the following binding chain: Z7/?i(i(j„|+i,A„,+i(var(y„,))), 
bind{yn^+2,An^+2{var(yn^^l))), . . . ,bind{yn2,An2{var{y„2^i))), where the names y,-,A; are reserved for 
these purposes and are all distinct. The BCh-expressions are permitted only in the env* -notation, like a 
sub-multi-set, and we denote this for example as env*{. . . UBCh(M,A'2) U r). 

Context-classes are required to correctly model the overlappings in L^g^rf. The transformations in 
Figure [T] contain only C-contexts, whereas in Figure[2]there are also J^- and ^-contexts, and the overlap- 
ping also requires surface contexts S- The grammar definition of and 5-contexts (definition 2.3 1 



justifies the replacement of ^-contexts by expressions containing only jl-contexts and BCh-expressions. 
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Thereby some rules of Figure |2] may be split into several rules. The context class C means all contexts 
and S means all contexts where the hole is not in an abstraction. In the term encoding, these translate to 
context variables. The unification algorithm must know how to deal with context variables of classes J?, 
S and C- The partial order on context classes is < 5 < {T. For every almost ground context C it can 
be decided whether C belongs to Si (or S, respectively). We wiU use the facts that equational deduction 
w.r.t. LC does not change the context class of almost ground contexts, and that prefix and suffix contexts 
of almost ground contexts C have the same context class as C (among Si, S and C). 

4 A Unification Algorithm LCSX for Left-Commutativity, Sorts and 
Context- Variables 

We define unification problems and solutions as extension of equational unification (see fSl). 

A unification problem is a pair (F, A), where F = {^i =t\, . . . ,Sn = the terms Si and f,- are of the 
same sort for every / and may also contain BCh-expressions, every context variable is labelled with a 
context class symbol, and A = (Ai,A2) is a constraint consisting of a set of context variables Ai and a 
set A2 of equations and inequations of the form Ni + \= Nj and Ni < Nj for the integer variables N,. The 
intention is that Ai consists of context variables that must not be instantiated by the empty context, and 
that the constraints A2 hold for (j{Ni) after instantiating with a. 

A solution a of (F, A), with F = {si = ti, . . . ,s„ = f„} is a substitution a according to the following 
conditions: i) it instantiates variables by terms, context variables by contexts of the correct context class 
that are nontrivial if contained in Ai, and the integer variables Ni by positive integers according to the 
constraint A2. ii) G{si),a{ti) are almost ground for all /. Itis assumed that the BCh-constructs BCh(«i,«2) 
are expanded into a binding chain as explained above, iii) (j{si) =lc (^{U) for all /. 

A unification problem F is called almost linear, if every context variable occurs at most once and 
every variable of a non-empty sort occurs at most once in the equations. 

Definition 4.1 Let Ylj be the set of left hand sides of reduction rules from Figure^and Il„o the set of 
left hand sides of no-reduction rules from Figure^where the reduction contexts R in (Ibeta) and (lapp) 
are instantiated by the four possibilities for R: A, (letrec Env in A), (letrec y[ = A,Env inA2\yi]), 
(letrec = A,BCh{N\,N2),Env inA[yN^]) with constraint Ni < N2- The meta-variable s in the cp 
rules (that can be either a variable or an abstraction) is instantiated by var[z) and an abstraction Xx.t 
where t denotes a meta-variable for an arbitrary expression. With n^,n^^ we denote the sets where left 
hand sides of rules are encoded as terms. 

We consider the set of unification problems F,- = {S{lT.i) = Ino.j \ ho.j G n'^^^} with Ijj G and S 
is a surface context variable. The sets Tlj and n^,^ are assumed to be variable disjoint, which can be 
achieved by renaming. The initial set Ai of context variables only contains the A2-context from the (cp-e)- 
reductions, and A2 may contain some initial constraints from the rules. The pairs (F,-, A) are called the 
initial L^gg^-forking-problems. 

Note that initial L^e^rf-forking-problems are almost linear, there is at most one BCh-construct, which 
is in the environment of the topmost let-expression, and there are no variables of type Bind. 

Definition 4.2 A final unification problem S of an initial F is a set of equations si =ti,...,Sn = tn, 
such that S = Sgv U S^bV' <^t^d every equation in Sbv of the form x = y where x,y are of sort BV and 
every equation in S^bv of the form x = t, where x is not of sort BV, and the equations in S^bv <^fe in 
DAG-solved form. 
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Given a final unification problem S, the represented solutions <7 could be derived by first instantiating 
the integer variables, expanding the BCh-constructs into binding chains, instantiating all context variables 
and variables that are not of sort BV and then turning the equations into substitutions. 

A final unification problem 5 derived from F satisfies the distinct variable convention (DVC), if 
for every derived solution a, all terms in C7(r) satisfy the DVC. This property is decidable: If ti = ?2 
is the initial problem, then apply the substitution a derived from S to ti. The DVC is violated if the 
following condition holds: Let Mbv be the set of SV-variables occurring in <j(fi). For every BCh- 
construct BCh{Ni,N2) occurring in o{ti) we add the variable y^^ ^ Mbv. If O'(^i) makes two variables 
in Mbv equal, then the DVC is violated, and the corresponding final problem is discarded. 

Example 4.3 Unifying (the first-order encodings of) XxXy.x and Xu.Xv.v, the unification succeeds and 

generates an instance that represents Xx.Xx.x, which does not satisfy' the DVC. Thus a variant of our 
unification can efficiently check alpha-equivalence of lambda-expressions that satisfy the DVC. 

We proceed by describing a unification algorithm starting with initial L„gej-unification problems 
(F,A). It is intended to be complete for all common instances that represent Lngg^-expressions that 
satisfy the DVC, i.e. where all bound variables are distinct and the bound variables are distinct from free 
variables. Final unification problems that lead to expressions that do not satisfy the DVC are discarded. 

Given an initial unification problem F = {^i = /i}; A, the (non-deterministic) unification algorithm 
described below will non-deterministically compute a final unification problem 5 or fail. A finite 
complete set of final unification problems can be attained by gathering all final unification problems in 
the whole tree of all non-deterministic choices. We implicitly use symmetry of = if not stated otherwise. 
We divide F in a solved part S, (a final unification problem), and a still to be solved part P. We usually 
omit A in the notation if it is not changed by the rule. 

Standard unification rules. 

S;{f{si,...,Sn)=f{tl,...,tn)]^P . . ^. . ... , 

Dec — —z—r — -. -. — s If / IS a free function symbol (i.e. / env). 

S; {si =ti,...,s„ = t„}UP 

^, S;{x = t}biP S;{s = s}\£P 

Solve . \ 1 , , c n TVivial *- ^ / 

{x = t}\JS;P S;P 

5; {/(...)= g (...)} 1+) P , „ „ 5; If 5 is final and the DVC is violated w.r.t. 

Fail L/VC^rail ~ . . . . 

Fail Fad the initial problem. 

Note that the occurs-check is not necessary, since P is almost linear and an equation x = t for 
variables x of type BV implies that ? is a variable. 

Solving equations with context variables. The rules for terms with contexts as top symbol using their 
context classes are as follows: The following rule operates on context variables at any position: 

S' P' A 

Empty-C — If X occurs in P and X ^ Ai . 

select one of the following possibilities 

5; P;{X}\J Ai or {X [•] } U 5; {X [•] }P; Ai 

Assume there is an equation X{s) = t, where the top symbol of t is not a context variable and X e Ai. 
Note that the sort of X{s) is Exp. There are the following possibilities: 

Dec-CA S;{X{s)=app{tut2)}^P 



{X^app(X',t2)}US; {X'{s) =ti}UP 
X' is a fresh context variable of the same context class as X. 
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{X^f{ti,X')}\JS;{X'{s)=t2}yjP 
X' is a fresh context variable of the same context class as X (it may only be C or S) and / is a 
function symbol such that / G {let,app}. 

Dgj..CL 5; {X{s)=let{h,t2)}^P 

{X /e?(env*({^7jn^f(x,Z')}Uz),?2)}U5; {env*{{bind{x,X'{s))}Uz) =ti}UP 
If X is of context class 5 or C. X' is a fresh context variable of the same context class as X. 

Dec-Lam 5; = /a«z(.i,.,)} 



{X^lam{ti,X')}^S; {X'{s)=t2}UP 
If X is of class C- X' is a fresh context variable of the class C- 

Fail-Lam {X{s) = lam{H,t2)}^P ^^^^^^ S; {X js) = var{x)} ^ P 

Fail Fail 
If X is of class or 5. 

Given an equation X{s) = 7(f), with X,Y e Ai, let D be the smaller one of the context classes of X,Y. 
Then select one of the following possibilities: 

S; {X(s)=Y(t)}\SP 
Merge-P 'ivy \ jj 



{Y^ZY',X^Z}US; {s = Y'{t)}UP 
Y' is a fresh context variable of the same context class as Y, and Z has context class ®. 

S; {X(s) = Y(t)}\£P 
Merge-FA ' \ \ J \ JS 



{X>-^Z{app{X',Y'{t)),Y>-^Z{app{X'{s),Y'))}\JS; P 
If exactly one of the context classes of X,Y is SI. W.l.o.g. let X be of context class Jl. X',Y' 
are fresh context variables of the same context class as Z,7, respectively, and Z is a fresh context 
variable of context class J^.. 

Merge-FC 

S; {X{s) = Y{t)}[^P 

choose either of the following possibilities 

{X ^ Z{app {X\Y'{t))),Y ^ Z{app {X'(s),Y'))}US; P 

{X^Z{let{env*{{bind{xX)}^,z)J'{t)))J^Z{let{env*{{bind{xX{s}^^ P 
{Xi-^Z(/e;(ew*({Wn^/(;c,Z'),Z7mJ(j,7'(;))}Uz),w)), 
Y <-^Z{let{env*{{bind{xX{s)),bind{y,Y')}Uz),w))}US-, P 
If the context classes of X, 7 are different from Ji. X' ,Y' are fresh context variables of the same context 
class as X, 7, respectively and Z is a fresh context variable of context class 'D. The variables are 
also fresh and of the appropriate sort. 

Rules for Multi-Set Equations. The following additional (non-deterministic) unification rules are suf- 
ficient to solve nontrivial equations of type Env, i.e. proper multi-set-equations, which must be of the 
form env* {L\ U ri ) = env* (L2 U r2), where r\ , r2 are variables or the constant emptyEnv. We will use the 
notation L for sub-lists in e«v* -expressions and the notation Li UL2 for union. In the terms env*{LUt), 
we assume that / is not of the form env{. . .). It is also not of the form Z(. . .) due to the sort assumptions. 
Other free function symbols are disallowed, hence t can only be a variable or the constant emptyEnv. 
The components in the multi-set may be expressions of type Bind, i.e., variables or expressions with 
top symbol bind, or a BCh(. . .)-component that represents several terms of type Bind. We also use the 
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convention that in the conclusions of the rules an empty environment env* ({ } U r) without any bindings 
and just a variable r is identified with r. Note that the lists allow multi-set operations like reorderings. 

Due to the initial encoding of reduction rules, if a BCh(A^i,//2) -construct occurs in a term in P, it 
occurs in an env* -list, hence there is also a binding yN^ = 5 in the env* -list, and the Ust is terminated with 
a variable derived from the environment-variable Env. In equations, the BCh(. . .)-components initially 
appear only on one side, which cannot be changed by the unification. Also the env* -list is an immedi- 
ate sub-term of a top let-expression, which may change after applying unification rules. Due to these 
conditions, we assume that the left term in the equation does not contain BCh(. . .) -components. 

If there is an equation env*{Li U ri) = env*(L2 U r2), then select one of the following possibilities: 

S;{env*{UUn)=env*{L2Ur2)}^P „ ... • . u • u, 

Solve-E 7 7 r TT — - — If n , r2 are variables; Z3 is a fresh variable. 

{n env* (L2 U Z3 ) , r2 env* (Li U Z3 ) } U 5; P 

jj^^g S; {env* (Li U n ) = env* (L2 U } ^ P 

S; {ti =t2,env*{Li\{ti}Un) = env*{L2\{t2}Ur2)}\£P 
If Li and L2 contain binding expressions ti and, ?2- 

Dec-Ch 

S; {env*{LiUn) = env*{BCh{Ni,N2)UL2Ur2)}^P; {Ai,{Ni <N2}UA2) 

select one of the following possibilities 
(?) 5; {ti = bind(yM2,AN2(var{yN^))), 

env*{Li\{ti}Uri)=env*{L2Ur2)}lJP; {AjvJ U Aj, {M + 1 = A^2} U A2 
(//) S; {ti = bind{yN3,AN3{var{yMi))), 

env*(Li\{fi}Uri) =ew*(BCh(A^3,Af2)UL2Ur2)}UP; {A^,} U Ai, {M + 1 = ^3,^3 < A^2} U A2 
(Hi) S; {ti = bind{yN2,AN2ivar(yN^))), 

env*{Li\{ti}Uri) = env*{BCh{Ni,N3)UL2Ur2)}UP; {A^Vj} U Aj, {M < A^3,A^3 + 1 = A^2} U A2 
(iv) S; {ti = bind{yN^,AN^{var{yNi))) , 

env* (Li \ {fi } u n ) = env* (BCh(Afi , A^s) U BCh(Af4 , A^2) U L2 U r2) } U P; 
{An,}UAi,{Ni < N3,N3 + 1 = N4,N4 < A^2} U A2 
Where yN2jyN3,yN4,Aj\/2,A]^^,AN^,N3,N4 are fresh variables of appropriate sort. 

S; {env* (L U f ) = empty Env} W P 

Fail-E — — 

Fad 

If L is nonempty, i.e contains at least one binding or at least one BCh-expression. 

An invariant of the rules that deal with BCh is that the variables A^,- may appear at most twice in F; at 
most twice explicit in A2 and at most once in BCh-expressions. 



4.1 Properties of the LCSX-Unification Algorithm 

Lemma 4.4 For initial problems, the algorithm LCSX terminates. 

Proof. For this we can ignore the rules that change A. 

The following measure is used, which is a lexicographical combination of several component mea- 
sures: 111 is the number of occurrences of let in P; the second component /I2 is the following size- 
measure, where env*(LLJr) has measure 7m + m' + £/i2(f/) + {J.2{r) where m is the number of bind- 
expressions in L and and m' is the number of BCh-expressions in L. 

The critical applications are the guessing rules for equations with top-context variables, and the rules 
for multi-equations. The context variable-guessing either decreases the size or the number of occurrences 
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of let. The multi-equation rules in rule Dec-Ch have to be analyzed. The new constructed bind-term has 
size 5, so the subcases (/) - (///) strictly reduce the size. The subcase {iv) adds 6 to the size due to new 
sub-terms, and removes 7 since t\ is a non-BCh-expression and removed from the multi-set. □ 

Lemma 4.5 The non-deterministic rule-based unification algorithm LCSX is sound and complete in the 
following sense: every computed final unification problem that leads to an expression satisfying the 
DVC represents a set of solutions and every solution of the initial unification problem that represents an 
expression satisfying the DVC is represented by one final system of equations. 

Proof. Soundness can be proved by standard methods, since rules are either instantiations or instantia- 
tions using the theory LC. 

Completeness can be proved, if every rule is shown to be complete, and if there are no stuck unifi- 
cation problems that have solutions. The Solve rules are complete since solved variables (in equations 
of the form x = t) we. just marked as such, i.e. moved to a set of solved equations. Solving equations 
X{s) = f is complete: if f is a variable, then it can be replaced; if f is a proper term of type Exp, then all 
cases are covered by the rules. In the case that the equation is X(5) = Y{t), the rules are also complete, 
and also respect the context classes of X, F. If the equation is 5 = 5, then it will be removed, and if it is 
of the form /(. . .) = /(. . .) then decomposition applies. In the case that the top symbol is env, the rules 
for multi-equations apply, i.e., the rules for env* . Using the properties of the equational theory LC and 
the considerations in |T|), we see that the rules are complete. □ 

Theorem 4.6 The rule-based algorithm LCSX terminates if applied to initial Lneed-forking-problems. 
Thus it decides unifiability of these sets of equations. Since it is sound and complete, and the forking 
possibilities are finite, the algorithm also computes a finite and complete set of final unification problems 
by gathering all possible results. 

Theorem 4.7 The computation of all overlaps between the rules in Figure [7] and left hand sides of 
normal order reductions in Figure |2] can be done using the algorithm LCSX. The unification algorithm 
terminates in all of these cases and computes a finite set of final unification problems and hence all the 
critical pairs w.r.t. our normal order reduction. 

5 Running the Unification Algoritlim LCSX 

Example 5.1 The goal is to compute a complete set of forks for the transformation (cp-e) 

(letrec x = s,Env,z = C[x] in r) — t- (letrec x = s,Env,z = C[s] in r) 

from Figure^ We instantiate the meta-variable s by the expression kw.t and translate the left hand side 
of the rule into the term language, resulting in the following initial forking problem to be solved 

{S{let{env* {{bind{x, lam{w,t)) , bind{z,C{yar{x) ) ) } U Env) ,r)) = l„o.j} ■ 

where Inoj is an encoded left hand side of an no-reduction rule. We pick a single equation from this set: 

S{let{env* {{bind{x, lam{w,t)) , bind{z, C{var{x) ) ) } U Envi) , r) ) 
= let{env*{{bind{x',lam{w',t')),bind{yNt,ANi{var{x')))}UBCh{Ni,N2) UEnv2),A{yN2)) 

which describes the overlaps between the (cp-e) transformation and the normal order (cp-e-c) reduc- 
tion. No we compute one possible final problem via the presented unification algorithm. A nontrivial 
possibility is to choose S = [■] via the Empty-C-rule and then using decomposition for let which leads to 
r = A{yt^^)) and the equation 
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env*{{bind{x, lam{w,t)) , bind{z, C{var{x)))} U Envi) 
= env* {{bind{x' ,lam{w' ,t')),bind{yNi ,Ani ivar{x')))}U BCh{Ni,N2)UEnv2). 
One choice for the next step (via the rule Dec-Ch ) results in the equations: 

bind{z,C{var{x)) = bind{yii^,Aj^^{var{yj^^))), env* {{bind{x,lam{w,t))} U Envi) 
= env*{{bind{x',lam{w',t')),bind{yN, ,Ani (var(x')))} U {BCh{N]_,N?,)VJBCh{N^,N2)^Env2) 

where one binding is taken from the BCh{Ni,N2) -construct and the chain is split around this binding 
into two remaining chains. The two bindings bind{x,lam(w,t)) and bind(x' ,lam{w' ,t')) are unified (via 
Dec-E) and then we solve the equation between the environments (Solve-E) and (after three additional 
Dec-steps, two for bind and one for lam) we arrive at the system 

C{var{x)) = AN^{var{yN^)),z = yN4i^ = x' ,w = w' ,t = t',Env2 = env* {{bind{x,lam{w,t))} U Env^) , 
Envi = env* (^{bind{x' ,lam{w' ,t')),bind(yNi ,ANi(yar{x')))}UBCh{Ni,N3)LIBCh{N4,N2)LlEnv3). 

Next we apply Merge-FA to the first equation, yielding 

C = Z{app{A'f^^{var{yN,)),C')),AN2 = Z{app{A'f^^,C' {var{x)))) 

where Z,A'i^^ are of context class and C' is of context class C- The final representation is: 

S^BV = {S= [■],r = A{yN,),C = Z{...),t = t' ,An, = Z(. . .),Env2 = env*{. . .),Envi = env*{. . .)} 
Sbv = {z = yN4,x = x',w = w'} 

The resulting expression is: 

letrecx' = (Xw'.t'),yNi = ANi[x'],{yi+i = A;+i [3;,-]}^^^^ , 

jiV, =Z[app{A'f^^{var{yN,)),C[x'])],{yi+i = A^+i [y,-]}^^^^,^^! inA^VA^J 

The corresponding fork is given by reducing the expression with (no,cp-e-c) and (cp-e) respectively 

letrec jc' = (Xw' .t'),y!^^ =A/jr|[jc'], 
{yi+i = Ai+i [yi] }^^^ ,yN^ = Z[app{A'^^ [var{yNj )] , C [x'] )] , 
{yi+i =Ai+i[yi]}f^N^,Env2 ±nA[yN2] 

no,cp-e-c 

, ,x , ,-, letrec x' = (Aw'.f'), v«, —An,\x'\, 

letrecx' = {Xw'.t'),yN,=AN,[XW.i'], ^^'j^^ 

{yi+i=Ai+i\yi]}^lj^ yN4^Z[app{A'j^ [var{yN^%C'[x'])], ^1"^)^ / ./m 

' , ,,A,, ^ ' , , yN^^Z[app(Af^^[var{yN^%C[Xy/ X])], 

{yi+l=Ai+i[yi\\'Env2i.nA[yN^\ - ^ • .r 1 

iS fCp-G no ^cp-c-c 

This fork can be closed by the sequence — > ■ < — ■ . Notice that for the determination of all forks 

it is sufficient to compute final systems. The (possibly infinite) set of ground solutions is not required. 

We implemented the presented unification algorithm LCSX in Haskell to compute all forks between 
transformations and no-reductions. The program computes 1214 overlaps for the L„gg^ calculus, and also 
searches for closing reduction sequences. Via this method we were able to close (almosQl all forks. The 
complete sets of forking diagrams for the transformations Uet and cp is in Figure [4] The implementation 
is available at: [http : //www . ki . inf ormatik.uni-fraiikfurt .de/research/dfg-diagram/eii[ 
More informal ton can be found in lITOll . 

'Some simple commuting diagrams for cp reductions are not automatically closed, due to renaming of bound variables. 
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Figure 4: Complet sets of forking diagrams for llet and cp transformations. 



6 Conclusion and Further Work 

We have provided an method using first-order unification with equational theories, sorts, context vari- 
ables and context classes and binding chains of variable length to compute all critical overlaps between 
a set of transformation rules and a set of normal order rules in a call-by-need lambda calculus with 
letrec-environments. Further work is to apply this method to further transformations and also to extend 
the method in order to make it applicable to other program calculi as in ifTSl . where variable- variable 
bindings are present in the rules, and to calculi with data structures and case-expressions. 
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